CDS

Accession Number TCMCG075C07718
gbkey CDS
Protein Id XP_007045533.2
Location complement(join(38079699..38080421,38080528..38081178))
Gene LOC18610038
GeneID 18610038
Organism Theobroma cacao

Protein

Length 457aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007045471.2
Definition PREDICTED: UDP-glycosyltransferase 74G1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category CG
Description Belongs to the UDP-glycosyltransferase family
KEGG_TC -
KEGG_Module M00370        [VIEW IN KEGG]
KEGG_Reaction R03213        [VIEW IN KEGG]
R08164        [VIEW IN KEGG]
R08668        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00882        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K11820        [VIEW IN KEGG]
ko:K13691        [VIEW IN KEGG]
EC 2.4.1.195        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00380        [VIEW IN KEGG]
ko00966        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01210        [VIEW IN KEGG]
map00380        [VIEW IN KEGG]
map00966        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01210        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGCAGGAAAACAAGGCCAACAAAGGTCATGTTCTTGTCCTTCCGTATCCAGGCCAAGGCCACATCAACCCTATGCTTCAGTTCGCCAAACGCTTAGTCTCCAAAGGCGTCAAAGCCACACTGGTAACCACCATCTTCCTCTACAACTCCATTCTTTCCGACCCAACCAGCTCTATTGATCTCCAGACAATATCTGATGGCTTCGACGAAGGTGGTTTTGCTCAGGCCGGAAGCCCTGATGCTTACCTGTCAACTTTCCGGTCAGTAGGCTCGCAAAGTCTAGCAAGTTTAATCCAGAAACTTGGTGATACTGATTTTCCATTTGATGCCATTATCTATGATTCATTTCTGCCTTGGGCACTTGATGTTGCAAGGCAGTTAGGGTTACTCGGAGCAGTGTTTTTCACGCAGTCTTGTGCTGTTAACAGCATAAATTACCATGTGAGCGAGGGGCTTCTTAAGCTGCCACTTGAAGGACCTAATGTTTCCCTTCCTGGATTACCTCTATATAAAGTTTCCGAGCTGCCATCTGTGGTGTATCTTTATGGATCACACCCGGCCTGGTTTGATATGATCGTGAATCAATTCTCCAACATTGATGCAGCTGATTGGGTTCTGGTAAACACTTTTTACGAATTGGAGAAAGAGGTTGTAGATTGGATGTCAGAAATCTGGAAGCTAGGGACAGTAGGACCAACCATTCCATCCATGCACTTGGACAGAAGGCTAGAAGGTAACAAAGACTATGGCATGAATCTTTTCAACCCGAACACCAACACTTGCATGAATTGGCTAAATGGCAAGCCAAATGGCTCAGTGGTCTATGTCTCATTTGGGAGCTTGGCAGATCTAGGAGTCGAAGAAATGGCAGAGATAGCTTGGGGTTTGAAAGTAAGCAATTGCTATTTCTTGTGGGTAGTGAGGGAATCAGAAGAGACCAAGCTGCCATATAATTTCAAAGAGGAGACTGGGGAGAAGGGTTTGATGGTGGCATGGTGCCCTCAGTTGGAGGTGCTGGCACATGACTCTGTGGGATGCTTTCTTACTCATTGTGGCTATAATTCTGTTCTTGAAGCATTGTGTTTAGGGGTTCCAATGCTGGGAATGCCGCAATGGGCTGACCAAGCCACAAATGCAAAGCACGTTGAAGAGATTTGGGGAATTGGAATTAGAGCCTTCCCTGATGAGAAAGGTATTGTGAGAGGAGAGATTATACAACAGTGCATAAAGGAACTAATGGAAGGAGAAAGGGGCAAACAGGTTAAGGAGAATGCAAACAAGTGGAAGAATTTGGCTAGAGACGCAACTGATGAAGGTGGAAGTTCAGATAAGAACATTGACGAATTTGTGGCTAAACTACTCCATGCCTAG
Protein:  
MAQENKANKGHVLVLPYPGQGHINPMLQFAKRLVSKGVKATLVTTIFLYNSILSDPTSSIDLQTISDGFDEGGFAQAGSPDAYLSTFRSVGSQSLASLIQKLGDTDFPFDAIIYDSFLPWALDVARQLGLLGAVFFTQSCAVNSINYHVSEGLLKLPLEGPNVSLPGLPLYKVSELPSVVYLYGSHPAWFDMIVNQFSNIDAADWVLVNTFYELEKEVVDWMSEIWKLGTVGPTIPSMHLDRRLEGNKDYGMNLFNPNTNTCMNWLNGKPNGSVVYVSFGSLADLGVEEMAEIAWGLKVSNCYFLWVVRESEETKLPYNFKEETGEKGLMVAWCPQLEVLAHDSVGCFLTHCGYNSVLEALCLGVPMLGMPQWADQATNAKHVEEIWGIGIRAFPDEKGIVRGEIIQQCIKELMEGERGKQVKENANKWKNLARDATDEGGSSDKNIDEFVAKLLHA